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Abstract 

To assess the classification accuracy of a continuous diagnostic result, the receiver 
operating characteristic (ROC) curve is commonly used in applications. The partial 
area under the ROC curve (pAUC) is one of widely accepted summary measures due 
to its generality and ease of probability interpretation. In the field of life science, a 
direct extension of the pAUC into the time-to-event setting can be used to measure the 
usefulness of a biomarker for disease detection over time. Without using a trapezoidal 
rule, we propose nonparametric estimators, which are easily computed and have closed- 
form expressions, for the time-dependent pAUC. The asymptotic Gaussian processes 
of the estimators are established and the estimated variance-covariance functions are 
provided, which are essential in the construction of confidence intervals. The finite 
sample performance of the proposed inference procedures are investigated through a 
series of simulations. Our method is further applied to evaluate the classification ability 
of CD4 cell counts on patient's survival time in the AIDS Clinical Trials Group (ACTG) 
175 study. In addition, the inferences can be generalized to compare the time-dependent 
pAUCs between patients received the prior antiretroviral therapy and those without it. 

1 Introduction 

Decision-making is an important issue in many fields such as signal detection, psychology, 
radiology, and medicine. For example, preoperative diagnostic tests are medically necessary 
and implemented in clinical preventive medicine to determine those patients for whom surgery 
is beneficial. For the sake of cost-saving or performance improvement, new diagnostic tests 
are often introduced and the classification accuracies of them are evaluated and compared 
with the existing ones. The ROC curve, a plot of the true positive rate (TPR) versus the 
false positive rate (FPR) for each possible cut point, has been widely used for this purpose 
when the considered diagnostic tests are continuous. One advantage of the ROC curve is that 
it describes the inherent classification capability of a biomarker without specifying a specific 
threshold. Moreover, the invariance characteristic of ROC curve in measurement scale provides 
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a suitable base to compare different biomarkers. Generally, the more the curve moves toward 
the point (0, 1), the better a biomarker performs. 

In many applications, the area under the ROC curve (AUC), one of the most popular sum- 
mary measures of the ROC curve, is used to evaluate the classification ability of a biomarker. 
It has the probability meaning that the considered biomarker of a randomly selected diseased 
case is greater than that of a non-diseased one. Generally, a perfect biomarker will have the 
AUC of one while a poor one takes a value close to 0.5. Since the AUC is the whole area 
under the ROC curve, relevant information might not be entirely captured in some cases. 
For example, two crossed ROC curves can have the same AUC but totally different perfor- 
mances. Furthermore, there might be limited or no data in the region of high FPR. In view 
of these drawbacks, it is more useful to see the pAUC within a certain range of TPR or FPR. 
To evaluate the performance of several biomarkers, McClish (1989) adopted the summary 
measure pAUC for the FPR over a practically relevant interval. On the other hand, Jian, 
Metz, Nishikawa (1996) showed that women with false-negative findings at mammography 
cannot be benefited from timely treatment of the cancer. Thus, these authors suggested using 
the pAUC with a portion of the true positive range in their applied data. Although their 
perspectives are different, the main frame is the same: only the practically acceptable area 
under the ROC curve is assessed. As mentioned by Dwyer (1997), the pAUC is a regional 
analysis of the ROC curve intermediate between the AUC and individual points on the ROC 
curve. The pAUC becomes a good measure of classification accuracy because it is easier for 
a practitioner to determine a range of TPR or/and FPR that are relevant. Several estima- 
tion and inference procedures have been proposed by Emir, Wieand, Jung, and Ying (2000), 
Zhang, Zhou, Freeman, and Freeman (2002), Dodd and Pepe (2003), among others. A more 
thorough understanding of the ROC, AUC, and pAUC can be also found in Zhou, McClish, 
and Obuchowski (2002). 

Recent research in ROC methodology has extended the binary disease status to the time- 
dependent setting. Let T denote the time to a specific disease or death and Y represent the 
continuous diagnostic marker measured before or onset of the study with joint survivor func- 
tion S(t, y) = P{T > t,Y > y). For each fixed time point t, the disease status can be defined as 
a case if T < t and a control otherwise. To evaluate the ability of Y in classifying subjects who 
is diseased before time t or not, Heagerty, Lumley, and Pepe (2000) generalized the traditional 
TPR and FPR to the time-dependent TPR and FPR as TPR t (y) = P(Y > y\T < t) and 
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FPRt(y) = P{Y > y\T > t), which can be further derived to be (S(0, y) - S(t, y))/{l - S T (t)) 
and S(t,y) / Srit) with S'r(t) = S(t, — oo). For the time-dependent AUC, Chambless and Diao 
(2006), Chiang, Wang, and Hung (2008), and Chiang and Hung (2008) proposed different 
nonparametric estimators and developed the corresponding inference procedures. As for the 
time-dependent pAUC, there is still far too little research on this topic. We propose non- 
parametric estimators, which are shown to converge weakly to Gaussian processes, and the 
estimators for the corresponding variance-covariance functions. The established properties 
facilitate us to make inference on the time-dependent pAUC and can be reasonably applied 
to the time-dependent AUC because it is a special case of this summary measure. 

The rest of this paper is organized as follows. In Section 2, the nonparametric estimation 
and inference procedures are proposed for the time- dependent pAUC. The finite sample prop- 
erties of the estimators and the performance of the constructed confidence bands are studied 
through Monte Carlo simulations in Section 3. Section 4 presents an application of our method 
to the ACTG 175 study. In this section, an extended inference procedure is further provided 
for the comparison of the time-dependent pAUCs. Some conclusions and future works are 
addressed in Section 5. Finally, the proof of main results is followed in the Appendix. 

2 Estimation and Inferences 

In this section, we estimate the time-dependent pAUC and develop the corresponding inference 
procedures. Without loss of generality, the time- dependent pAUC is discussed for restricted 
FPR t (y) because that for restricted TPR t (y) can be derived in the same way by reversing 
the roles of case and control subjects. 

2.1 Estimation 

Let X be the minimum of T and censoring time C, 8 = I(X = T) represent the censoring 
status, and q at = FPRi 1 (a) = mi{y : FPR t {y) < a}, a £ (0,1], denote the (1 - a)th 
quantile of Y conditioning on {T > t} at the fixed time point t. Following the expression 
{— J TPF t (y)d y FPR t (y)} for the time-dependent AUC, the time-dependent pAUC 9 t (q a t) 
with the FPRt(y) less than a is derived to be functional of S(t,y): 

o fc, _ -I(S(0,u)-S(t,u))I(u>q at )d u S(t,u) 

Bq(5) " s T (t)(i - s T (t)) • (2 - 1} 
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Note that the value of 9 t (q a t) for a perfect biomarker should be a while a useless one is 0.5a 2 . 
Same with the interpretation of Cai and Dodd (2008), the rescaled time-dependent pAUC 
9 t (q at )/a can be explained as the probability that the test result of a case {Tj < t} is higher 
than that of a control {Tj > t} with its value exceeding q at for i ^ j, i.e., P(Yi > Yj\Ti < 
t,Tj > t,Yj > q at ). 

From the formulation in (I2.ip . an estimator of t (q a t) can be obtained if S(t, y) is estimable. 
Under marker dependent censoring (T and C are independent conditioning on Y), Akritas 
(1994) suggested estimating S(t,y) by S(t,y) = n~ l Y^l=i ST{t\Yi)P\Yi > y), where 

Mt\v)= IT { 1- ^&W-S y (y)) } (2.2) 
{i-.x i <t,5 i =i} nS x {X t \y) 

is an estimator of Sr(t\y) = P(T > t\Y = y) with Sy{y) = rT 1 YTj=\ I(Yj > y) and Sx(t\y) = 
n- 1 Y% =1 1{Xj > ^KxiSyiYj) - S Y {y)) being estimators of S Y (y) = P{Y > y) and S x (t\y) = 
P(X > t\Y = y). Here, K\(u) = (2A)~ 1 /(|n| < A) and A is a nonnegative smoothing 
parameter. Substituting S(t,y) for S(t,y) in (12. ip . 6 t (q a t) is proposed to be estimated by 

? , Ba{§) = n-^(l : SHt\Y l )^ i m^) i ( , 3) 

S T (t)(l-S T (t)) 

where ^(y) = I{Y { > Y 3 > y), q at = FPR~\a), FPR t {y) = S(t,y)/S T {t), and S T (t) = 
S(t,—oo). In the Appendix, we show that \/n(6 t (q a t) — t {q at )) is uniformly approximated 
by rC 1 ! 2 Y^i=i ^ai{t) and converges weakly to a mean zero Gaussian process with variance- 
covariance function E a (s,t) = E[^/ ai (s)^ a i(t)} for t G (0,r] and P(X > r) > 0. The applica- 
tion of kernel function K\(u) provides the nearest neighbor estimator of S(t, y). An alternative 
choice of kernel function is possible and will lead to a different estimator of 6 t (q a t)- As men- 
tioned in Akritas (1994), the asymptotic properties of S(t,y) is irrelevant to the choice of 
kernel function under some regularity conditions and so is 6t(q a t)- The author further showed 
that any other estimator for S(t, y) is at least as dispersed as S(t, y) and the choice of A is 
irrelevant to the measurement scale of Y. It is not difficult to see that the estimation problem 
of 9 t (q a t) becomes that of S(t,y). From this perspective, the proposed estimation procedure 
can be extended to any censoring or truncation mechanisms provided that S(t, y) is estimable. 

Alternatively, one might be interested in making inference on the time-dependent pAUC 
(9t(q a 't) — 9t(q a t)) over the range [a, a'], < a < a' < 1, of FPR t (y). The estimator 
(9t(q a 't) — 9t(q a t)) is suggested and the limiting Gaussian process of y/n{(9t(q a 't) — 9t(q a t)) — 
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(Otila't) — @t(<lat))} is a direct consequence of the large sample property of t (q at ). When the 
complete failure time data {T it Y{\™ =1 are available, S(t, y) can be estimated by an empirical 
estimator S(t, y) = n^ 1 Yl7=i ^(-^ > t,Yi > y). A natural estimator for 6 t (q a t) is obtained as 



3",- , n-*Y. iH m<t,T 1 >t)<t> ti {^ l ) 

e,(q " ] ~ Mmi-sAt)) ' (2 ' 4) 

where q at = FPR t (a), FPR t {y) = S(t,y)/ Sx(t), and S'r(t) = S(t, — oo). By substituting 
the disease and disease- free groups for the time- varying case and control ones, 6 t (q a t) and the 
estimator 8 t (q a t) will reduce to the time-invariant pAUC and the nonparametric estimator of 
Dodd and Pepe (2003). By the similar argument as in the proof of the asymptotic Gaussian 
process of 9 t (q at ), it is straightforward to derive that y/n(6 t (q at ) —6 t (q at )) converges weekly to a 
Gaussian process with mean zero and variance-covariance function E* (s, t) = £'[^ r * i (s)^ r * i (f)] ) 
where 

^ , s = Uj(t, q«t) + y(t, q«t)V*(t, -oo) + (aS T (t) - S Y (q at ))(V*(t, q at ) - aV*(t, -oo)) 
aA) S T (t)(l-S T (t)) 

U*{t,y) = E[h* tJ (t,y) + h^y^Yi] - 2H{t,y), h* 3 (t,y) = I{T t < t,Tj > t)^(y), and 

V*(t,y)=I(T i >t,Y i >y). 

2.2 Inference Procedures on the Time-Dependent pAUC 

The confidence intervals for 6 t {q a t) and (0 t (q a r t ) —9 t (q a t)) can be constructed by the asymptotic 
Gaussian processes and the estimated variance-covariance functions. Replacing the parameters 
with their sample analogues, ^ a i(t) is proposed to be estimated by 

^ / , _ Ui(t,q Qt ) +y(t,qod)Vi(t,-oo) + (aS T (t) - S Y (qat))(Vi(t,q at ) - aVijt, -oo)) 
mU " S T (t)(l-S T (t)) 

where U^t, y) = n~ l Y.r,j .■ , } l J'uU ■ v) v)) ~ 2 %, y) + @ Y 0Q - S(t, y))&(t)I(Yi > y), 

Vi(t,y) = (S T {t\Y^+l(t))I(Yi > y) - S{t,y), and r){t,y) = H(t,y)(2S T (t) - l)/{S T (t) - 
S 2 T (t)) with ^-(t, j/) = (1 - STitlY^Srm^M^^y) = n^^h^y), g(t) = 
-S T (t\Yi) fiSx^ulYjdJdiiulYd, and M^Y) = I(X t < t)6 t + \nS T (t A X^). Thus, it 
is straightforward to have an estimated variance-covariance function 

1 n ~ 

E tt (M) = (2.6) 

i=l 

and a (1 — q), < q < 1, pointwise confidence interval for 9 t (q a t)- 

t (q*t)±^pX 1 J 2 (t,t), (2.7) 

'71 



where Z^/ 2 is the (1 — q/2) quantile value of the standard normal distribution. With the in- 
dependent and identically distributed representation n" 1 / 2 Y^i=i ^ai{t), the re-sampling tech- 
nique of Lin, Wei, Yang, and Ying (2000) is applied to determine a critical point L f so that 

P( sup i ^fe)-^)) , <Lq) = l _, (2 . 8) 
te[n,7a] T£ (t,t) 

for a subinterval [t±, r 2 ] of interest within the time period [0, r\. The validity of (12. 8p enables 
us to construct a (1 — q) simultaneous confidence band for {0 t (q at ) : t G [ti,t 2 ]} via 

{6 t (q at ) ± ^i /2 (t,t) : t G [71,72]}. (2.9) 

'77, 



Note that both pointwise and simultaneous confidence bands for (9t(q a >t) — 9t{<lat)) can 
be constructed as the above ones. When 8t(q a t) is applicable, the confidence bands are easily 
obtained by substituting S* (s, t) = ra" 1 Y%=i *ai( s )*ai(*) for ^^(s, t) in (EHJ) and §ZM), where 

Qat) + V(t, q a t)V*(t, -oo) + (a5r(*) - (^""(i, ?ot) - -oo)) 



S T (t)(l-S r (*)) 



U*(t,y)=n- 1 J2 {j .^ i} (h* j (t,y)+h* i (t,y))-2H^ £^ -h^t, y), and Jfti, y) = 
H(t,y)(2S T (t)-l)/(S T (t)-SUt)). 

3 Numerical Studies 

In this section, Monte Carlo simulations are conducted to investigate the finite sample proper- 
ties of the proposed estimators and the performance of the inference procedures. The continu- 
ous biomarker Y is designed to follow a standard normal distribution. Conditioning on Y = y, 
the failure time T and the censoring time C are independently generated from a lognormal 
distribution with parameters \x = —0.15?/ + In 10 and a = 0.3, and an exponential distribution 
with scale parameter 10b {2 1 (y < 0) + I(y > 0)}, where the constant b is set to produce the 
censoring rates of 0%, 30% and 50%. In our numerical studies, 500 data sets of 500 and 1000 
observations are simulated. The estimators and the pointwise confidence intervals of 9 t (q a t) 
are evaluated at the selected time points t 0A , t .5, and to.6 with a =0.1, 0.2, and 0.3, where 
t p is the pth quantile of the distribution of T. Moreover, the simultaneous confidence bands 
for 9 t (q at ) over the subintervals [to.4^0.5] an d [^0.4^0.6] are considered. Since a small portion 
of cases or controls occur outside [to.4;^o.6] under the above design, the simulation results are 
presented within this time period. 
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When survival times are subject to censoring, an appropriate smoothing parameter ur- 
gently becomes necessary in the estimation of 9 t {q a t)- It usually attempts to select a band- 
width that minimizes the asymptotic mean squared error of an estimator, which is obtained 
by using the plug-in method for unknown parameters. This approach, however, would lead 
to further bandwidth selection problems and is infeasible in our current setting. For the 
bandwidth selection, we propose a simple and easily implemented data-driven method. This 
procedure is to find a bandwidth, say, X opt which minimizes the following integrated squared 
error 



Jo 

where S e (u) is the Kaplan-Meier estimator computed based on the data {ej,<5j}™ =1 , = 
1 — (Xi\Yi), Sj, l \t\y) is computed as ST{t\y) with the ith observation (X i; 5i, Yi) deleted, 
and N e i(u) = 5il(ei < u). The rationale behind (13. ip is that {1 — SrpQl^), $i}iLi can 
be shown to be an independent censored sample from a uniform distribution U(0, 1) under 
the validity of conditionally independent censoring. For each generated sample, 9t(q a t) and 
^ai^'s are computed by using A op t and the subjective bandwidths of 0.01 and 0.2. Among 
the 500 simulated samples, the bandwidths obtained from minimizing ISE(X) in (13. ip have 
a range between 0.01 and 0.2, and medians of about 0.09, 0.1, 0.1, 0.07, 0.08, and 0.08 for 
(n,c.r.) of (500,0%), (500,30%), (500,50%), (1000,0%), (1000,30%), and (1000,50%), where 
n and c.r. represent the sample size and the censoring rate. 

Tables 1-3 summarize the averages and standard deviations of estimates, the standard er- 
rors, and the empirical coverage probabilities of 0.95 pointwise confidence intervals for 9t(q a t)- 
For the complete failure time data (i.e., c.r. = 0%), 6 t (q at ) and 9 t (q a t), computed using A op (, 
give separately a slight overestimate and underestimate of 6t{q a t)- Moreover, the variance 
E* (t, t) tends to be underestimated, which leads to a lower coverage probability. As expected, 
the bias and standard deviation of 6 t (q a t) will separately increase and decrease as the band- 
width becomes larger. It is also detected from these tables that the poor estimates of £ a (£, £)'s 
appear at extremely small or large bandwidths. In the numerical studies, the larger bias of 
#t(<7o.it) is found and the main reason for this is because it is computed via comparing only 
(at most) the top ten percent of subjects in the control group with those in the case group. 
For a = 0.3, the availability of data used for statistical analysis is expanded and, hence, the 
performance becomes better. The biases of the proposed estimators is indistinguishable in 
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the presence of heavy censoring, whereas the standard deviations and the standard errors will 
become larger. At each simulated sample, we can see that the estimators using X opt provide 
generally satisfactory results. 

The empirical coverage probabilities of pointwise confidence intervals (tables 1-3) show the 
good performance of bandwidths selected from the automatic selection procedure for interval 
estimation, except for (n, c.r.,a) of (500, 50%, 0.1) and (500, 50%, 0.2) at the time point t .6- 
However, most of the coverage probabilities are lower than 0.95 for the bandwidths of 0.01 and 
0.2. For samples without censoring, it is revealed from tables 1-3 that the empirical coverage 
probabilities of the pointwise confidence intervals computed based on 6 t (q at ) with X opt are 
more close to the nominal level of 0.95 than those based on 9 t {q a t)- Similar conclusions can be 
also drawn from table 4 for the simultaneous coverage probabilities. The empirical coverage 
probabilities of the simultaneous confidence band ( 12. 9p with X opt roughly stay around 0.95 
except for the wider interval [to.4;^o.6]- 

4 A Data Example - ACTG 175 Study 

In the ACTG 175 study (Hammer et al. (1996)), the classification accuracy of CD4 cell 
counts on the time in weeks from entry to AIDS diagnosis or death might depend on whether 
they received the prior antiretroviral therapy. A total of 2467 HIV-l-infected patients, which 
were recruited between December 1991 and October 1992, are considered. Of these patients, 
1395 received the prior therapy while the rest 1072 did not receive the therapy. During the 
study period, 308 patients died of all causes or were diagnosed with AIDS. For a negative 
association between CD4 counts and the time to AIDS and death, we let Y be a strictly 
decreasing function of the CD4 marker and T be the minimum of time-to-AIDS and time- 
to-death. Currently, there is still no standard of clinically meaningful values of FPR for the 
pAUC in AIDS research. In this data analysis, we restrict our attention to the pAUC of Y 
with the FPR less than 0.1 or 0.2 or 0.3. 

To simplify the presentation, the marker and the time-dependent pAUC for non-therapy 
and therapy patients are denoted separately by (Y^\ #t v9at )) an d (^ 2 \ Qt^iQat))- Based on 
two independent data sets {X?\ 5?\ >f and {XfU^lf^, and df\t}) 

are computed as 6 t (q at ) in (12. 3 \ using the bandwidths of 0.042 and 0.069, which are the 
minimizers of ISE(X) in (13. ip . The confidence intervals for 0^\q^y?> are further constructed 
from (12. 7p and (12. 9p . Due to the large variation of estimators before week 98, we only provide 

8 



the estimated time-dependent pAUCs and 0.95 pointwise and simultaneous confidence bands 
from week 98 to the end of study in figures 1 (a)-(f). Based on the summary measures 
0t^(<7oit)' s and compared with 0.005, one can conclude from the simultaneous confidence 
bands that the CD4 count is a useless biomarker in classifying patient's survival time within 
the considered time period for both therapy and non-therapy patients. However, a different 
conclusion will be drawn at each time point based on the pointwise confidence intervals. The 
time-dependent pAUCs 9^ (q^t ) an d 0[ (q^t ) are detected to be significantly higher than 
0.5a 2 for a=0.2 and 0.3 after week 110 and week 98, respectively. Figures 1 (a), (c), and (e) 
give a clear indication that the pAUCs decrease slightly over time for patients without prior 
therapy. However, the pAUCs stay very close to a constant throughout the study period for 
those with prior therapy (figures 1 (b), (d), and (f)). 

The difference in the classification accuracies of and can be measured by the 
summary index 7 Q (t) = Qt\'Zat) ~ ®\ ; (?at )■ When a — 1, 7«(t) is the usual comparison of 
AUCs in the time-dependent setting. It is natural to estimate 7 Q (t) by %,(£) = 9\ (q^J) — 
^t 2 \Qat)- Along the same lines as the proof in the Appendix, we can derive that \/ri(j a (t) — 
7 a (t)) converges weakly to a mean zero Gaussian process with variance-covariance function 
T a (s,t) = K-'E^is^it)] + (1 - n^El^is^it)] provided that n x /n k (0 < « < 
1) as n — (ni + n 2 ) — > oo, where (t) is a counterpart of \& a j(t), k = 1, 2. To make inference 
on 7 a (t), T a (s,t) is first estimated by 

f „(., t) = 4 E ®£? w®2 w + J E ®2 w®2 (*)• (4.D 

'''1 i '•'2 i 

1 «=1 z 1=1 

A (1 — <j) pointwise confidence interval for j a {t) an d a (1 — simultaneous confidence band 
for {7a (t) : t G [ti,t 2 ]} are separately given via 

Z i i 
%(t) ± -prl(t,t) and {%(t) ± -7=r»(t,t) : t G [n,r 2 ]} (4.2) 



with being obtained as (12. 8p . It is revealed in figures 2 (a)-(c) that j a (t), a=0.1, 0.2, 
and 0.3, tend to be positive within the study period and the difference becomes negligible 
as a increases. In other words, with small values of FPR t (y), a prior antiretroviral therapy 
might lower the discrimination ability of CD4 counts in classifying subject's t-week survival. 
One possible explanation for this conclusion is that the prior therapy makes patients more 
homogeneous in survival time and CD4 counts. The estimates are further found to be around 
zero after about week 160. It means that for long term survival classification the performance 
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of CD4 counts is irrelevant to whether patients receive prior therapy or not. Due to the 
large variability in the data, we could not detect any significant difference between 9[ {q^t ) 

(2) (2) 

and 9\ {q a t)- It would necessitate extremely large sample sizes to enable demonstration of 
significant differences between the pAUCs. 

5 Discussion 

For the time- dependent pAUC, it was traditionally estimated by the trapezoidal numerical 
integration method. The derivation for its sampling distribution becomes complicated and 
the computation load is prohibitively expensive. Although the inferences can be developed 
through a bootstrap technique, there is still no rigorous theoretical justification for this pro- 
cedure. We can see in this article that the proposed estimators are simple and have explicit 
mathematical expressions. The confidence bands are built based on the asymptotic Gaussian 
process of the estimators as well as the corresponding estimates of the asymptotic variances. 
The estimation and inference procedures are further shown to be useful through simulation 
studies and an application to the ACTG 175 data. 

It is detected from our numerical studies that the performance of the proposed estimator 
@t(q a t) is very sensitive to small value of a. To obtain a more stable estimate of 9 t (q a t), 
a large sample size relative to a is usually suggested especially in the presence of censoring. 
Moreover, the price for the assumption of marker dependent censoring is to find an appropriate 
bandwidth in estimation. To this problem, we propose a simple and easily implemented 
selection procedure and show its good performance through simulations. As for the estimation 
of t (q at ), it can be also derived via using the bivariate estimation methods of Campbell (1981) 
or Burke (1988) for S(t,y). However, these estimators are only valid under independent 
censorship which appears to be very limited and may not always be met in applications. 
One advantage of totally independent censoring assumption is that no smoothing technique 
is required. 

In some empirical examples, censored survival data of the form Y^ , Y^ }f = i are 

often occurred in a paired design with (Y^ , Y^) being the different biomarkers of the zth 
subject. The scientific interest usually focuses on comparing the discrimination abilities of 
and on subject's survival status at each time point within the study period. Ob- 
viously, the assumptions of marker dependent censoring made separately on (T, C, Y^) and 
(T, C, Y^) are often unreasonable in practice. Under a more flexible assumption of condi- 
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tionally independent censorship (T and C are independent conditioning on (Y^\Y^)), the 
estimated joint survivor function of T and and that of T and Y^ in this article are quite 
inadequate in the estimation of j a (t) without modification. It is worthwhile to investigate the 
associated comparison procedure in our future study. 

APPENDIX 

For the proof of main results, the assumptions in Akritas (1994) and the conditions (Al: 
ft(y) = -dFPR t (y)/dy exists with mi t f t (q at ) > 0) and (A2: sup t \q~ x {F P Rt{q at + q) - 
FPR t (q at )} + ft(q a t) \ — > as q — > 0) are made throughout the rest of this paper. 

Asymptotic Gaussian Process of 6 t (q at ): 

From Theorem 3.1 of Akritas (1994), one has 

i— n 

sup \yfr(S(t,y) - S(t,y)) -^Y. V ^)\ = °pV)> ( A ' X ) 

where V z (t,y) = (S T (t\Yi) + &(t))J(^ > y) - S(t,y) with £(t) = -5 T (t|F,) j^'H^) 
d u Mi{u\Yi), Mi(t\Yi) = I(Xi < t)5i + \nS T (t AX^Yi), and t AX { = min-^JQ}. Let hij{t,y) = 
(1 — ST(t\Yi))ST(t\Yj)<j)ij(y) and H(t, y) = E[hij(t, y)\. The uniform consistency of (t ] J/) (cf. 
Dabrowska (1987)) ensures that 

+- 2 - SrmmSTitlY,) - SritlYMMv) + r ln (t, y) (A.2) 

■ -L ■ 

«7=? 



with sup t \r\ n (t,y)\ = o p (n 1 ^ 2 ). By a direct calculation and (lA.lj) . a simplified form of the 



second term in the righthand side of (1A.2I) is obtained as follows: 

1 n i n 



n * — ' n 

j=i i=i 



ST(t\Y,Mt)My) + r 2n(t, y), (A.3) 



n 

1,3 



where sup t J/ \r 2n (t, y)\ = o p (n 1 ^ 2 ). Similarly, the third term can be expressed as 

1^(1- SrWtfZMfcjiy) + r Sn (t, y) (A.4) 



hi 



11 



with sup t |r 3n (t, y)\ = o p (n 1 ^ 2 ). It follows from 0A.2p - 0A.4p . the decomposition of a U- 
statistic into a sum of degenerate U-statistics (Serfling (1980)), and Corollary 4 of Sherman 
(1994) that 



sup \VE(H(t, y) - H(t, y ))-^J2 V)\ = °p(1), ( A - 5 ) 
t, y n 



1=1 



where Ufay) = Eft^y) + h^y^X^, 5,} - 2H(t,y) + {S Y {YJ ~ > 
y). By the Taylor expansion of 9 t (y) = H(t,y){S T {t)(l - SrC*))}" 1 at (H(t,y),S T (t)) = 
(H(t,y),S T (t)), OAT]) , and 0X5]) . one has 



sup |Vn(0 t (y) - 0t(y)) - — 2^ ^77771 — ^777^ ' = ( A - 6 ) 

t, y n o T [t){L - i3 T [t)) 

where rj(t, y) = H(t, y)(2S T (t) - l)(S T (t) -^(t))" 1 . Thus, y/n(9 t (q at ) - 9 t (q at )) can be shown 
to converge to a mean zero Gaussian process by an application of the functional cental limit 
theorem. 

For the asymptotic Gaussian process of 9 t (q at ), it is established through the equality 
\fn(6 t (q at ) - 6 t (q at )) = y/ri(9 t (q at ) - 9 t (q at )) + y/n(9 t (q at ) - 9 t (q at )). Let y/n(q at - q at ) = 
y/n(Q(S) — Q(S)) with Q : S — > q at . By assumptions (A1)-(A2), the Hadamard differentia- 
bility of Q is a direct result of Lemma A.l in Daouia, Florens, and Simar (2008). Together 
with the functional delta method (cf. Van der Vaart (2000)), we have 

ft ^ Qat) ~ 0iVi(t, -oo) 

i=l 

and the weak convergence of \fn{q at — q a t)- It is further ensured by (a version of) Lemma 
19.24 of van der Vaart (2000) and ([Oil that 



i ri~ \ V™ viy^qat -u^-ou;, 

sup \Vn (q at - q at ) ^ 77 \^7T\ = °p 1 ■ A ' 7 

t n f-' ft\qat)S T {t) 



sup |Vn(^(g Q i) - 6>t(g at ) - ^/n{9 t {q at ) - 9 t (q at ))\ = o p (l). (A. 8) 

i 

Moreover, the first order Taylor expansion of 9 t (q a t) at g Qf = g Qf , the continuity of d9 t (y)/dy, 
sup t \q at — q a t\ = o p (l), and the continuous mapping theorem imply that 



aS T (t) - S Y {q a t) 

"■f I V ' " V (- V'i cut y v l\iixi/j * q / ■ \ 

t 1 — OT(,tJ 



sup Iv^^tCgot) - 0t(<7at)) ^ o tt^ at ft{qat)Vn{q a t - q a t)\ = o p (l). (A.9) 



It follows from 0A~6]) -( TA~9]) that 

i— n 

sup \yfti(9 t {q at ) - 9 t (q*t)) - — Y] = o p (l), (A.10) 



n 

i=l 
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i ,t, u\ U i(^ + Qat)Vi(t, -oo) + (aS T (t) - S Y {q a t)){Vi{t, q at ) - aVi(t, -oo)) 

where * ai{t) = s T (t)(i-s T (t)) • 

Finally, the proof is completed by applying the functional central limit theorem to the ap- 
proximated term n -1 / 2 Y17=i in flA.lOp . 

REFERENCES 

Akritas, M. G. (1994). Nearest neighbor estimation of a bivariate distribution under random 
censoring. Annals of Statistics 22, 1299-1327. 

Burke, M. D. (1988). Estimation of a bivariate distribution function under random censor- 
ship. Biometrika 75, 379-382. 

Cai, T. and Dodd, L. E. (2008). Regression analysis for the partial area under the ROC 
curve. Statistica Sinica 18, 817-836. 

Campbell, G. (1981). Nonparametric bivariate estimation with randomly censored data. 
Biometrika 68, 417-423. 

Chambless, L. E. and Diao, G. (2006). Estimation of time- dependent area under the ROC 
curve for long-term risk prediction. Statistics in Medicine 25, 3474-3486. 

Chiang, C. T., Wang, S. H., and Hung, H. (2008). Random weighting and Edgeworth expan- 
sion for the nonparametric time-dependent AUC estimator. Statistic Sinica. Accepted. 

Chiang, C. T. and Hung, H. (2008). Nonparametric estimation for time-dependent AUC. 
Technical Report, National Taiwan University. 

Dabrowska, D. M. (1987). Uniform consistency of nearest neighbor and kernel conditional 
Kaplan-Meier estimates. Technical Report No. 86, Univ. California, Berkeley. 

Daouia, A., Florens, J. P., and Simar, L. (2008). Functional convergence of quantile-type 
frontiers with application to parametric approximations. Journal of Statistical Planning 
and Inference 138, 708-725. 

Dodd, L. and Pepe, M. S. (2003). Partial AUC estimation and regression. Biometrics 59, 
614-623. 

Dwyer, A. J. (1996). In pursuit of a piece of the ROC. Radiology 201, 621-625. 



13 



Emir, B., Wieand, S., Jung, S. H., and Ying, Z. (2000). Comparison of diagnostic markers 
with repeated measurements: a non-parametric ROC curve approach. Statistics in 
Medicine 19, 511-523. 

Hammer, S. M., Katzenstein, D. A., Hughes, M. D., Gundacker, H., Schooley, R. T., 
Haubrich, R. H., Henry, W. K., Lederman, M. M., Phair, J. P., Niu, M., Hirsch, M. 
S., and Merigan, T. C. (1996). A trial comparing nucleoside monotherapy with combi- 
nation therapy in HIV-infected adults with CD4 cell counts from 200 to 500 per cubic 
millimeter. The New England Journal of Medicine 335, 1081-1090. 

Heagerty, P. J., Lumley, T., and Pepe, M. S. (2000). Time-dependent ROC curves for 
censored survival data and a diagnostic marker. Biometrics 54, 124-135. 

Jiang, Y., Metz, C. E., and Nishikawa, R. M. (1996). A receiver operating characteristic 
partial area index for highly sensitive diagnostic tests. Radiology 201, 745-750. 

Lin, D. Y., Wei, L. J., Yang, I., and Ying, Z. (2000). Semiparametric regression for the mean 
and rate functions of recurrent events. Journal of the Royal Statistical Society B62, 
711-730. 

McClish, D. K. (1989). Analyzing a portion of the ROC curve. Medical Decision Making 9, 
190-195. 

Serfhng, R.J. (1980). Approximation theorems of mathematical statistics. New York: Wiley. 

Sherman, R. P. (1994). Maximal inequalities for degenerate U-processes with applications 
to optimization estimators. Annals of Statistics 22, 439-459. 

van der Vaart, A. W. (2000). Asymptotic Statistics. Cambridge: Cambridge University 
Press. 

Zhang, D. D., Zhou, X. H., Freeman, Daniel H. J., and Freeman, J. L. (2002). A non- 
parametric method for the comparison of partial areas under ROC curves and its appli- 
cation to large health care data set. Statistics in Medicine 21, 701-715. 

Zhou, X. H., McClish, D. K., and Obuchowski, N. A. (2002). Statistical methods in diagnostic 
medicine. New York: Wiley. 



14 



Table 1 

The averages (Mean) and the standard deviations (SD) of 500 estimates, the standard errors 
(SE), and the empirical coverage probabilities (CP) 



c.r. = 0% 






n = 500 








n = 1000 










Time 


H i i f ir\ -i , 1 


IVJA. c 111 






TP 


n/\ OQT1 


ou 


OJ_J 


TP 


vtyqo.xt) 


^0.4 

tr. r 

*0.6 


0.0264 
0.0279 


0.0273 
D D984 
0.0296 


0.0044 

U.UU^iu 

0.0046 


0.0041 
0.0045 


0.938 
n qi s 
0.914 


0269 

U.Ui i o 

0.0288 


0.0031 

U.UUOl 

0.0034 


0029 

U.UUOU 

0.0031 


0.944 
0.926 


Vt(qo.it) 
(A op t) 


^0.4 

J- 

*0.6 


0.0264 
U.Uzoy 
0.0279 


0.0240 

n no/in 
U.Uz4y 

0.0262 


0.0043 

n nn/i o 
U.UU4z 

0.0046 


0.0049 
U.UU4 1 
0.0046 


0.954 
U.yDU 
0.928 


0.0247 
U.Uzoo 
0.0269 


0.0030 
U.UUoU 
0.0033 


0.0034 

n nnQO 
U.UUoz 

0.0033 


0.948 
u.yoo 
0.942 


c.r. = 30% 






n = 500 








n = 1000 








A 


Time 


dt(qo.it) 


Mean 


SD 


SE 


CP 


Mean 


SD 


SE 


CP 


0.01 


toA 
to.5 
to.e 


0.0264 
0.0269 
0.0279 


0.0268 
0.0272 
0.0279 


0.0052 
0.0055 
0.0055 


0.0040 
0.0040 
0.0041 


0.834 
0.846 
0.848 


0.0264 
0.0272 
0.0280 


0.0034 
0.0038 
0.0040 


0.0030 
0.0030 
0.0031 


0.908 
0.874 
0.882 


Aopt 


tOA 
to.5 

to.e 


U.U264 
u.uzoy 
0.0279 


0.0234 
U.UZ40 

0.0264 


0.0050 
U.UUD4 

0.0055 


0.0058 
U.UUDD 

0.0056 


0.948 
u.yoz 
0.924 


n no A q 
0.0243 

u.uzdz 

0.0264 


0.0035 
U.UUo 1 

0.0039 


0.0041 
U.UU4U 

0.0040 


0.950 

n Q/19 
u.y4z 

0.910 


n 9n 
u.zu 


toA 
SO. 5 
*0.6 


0.0264 
u.uzoy 
0.0279 


0.0190 
U.uzuo 
0.0218 


0.0032 

U.UUo 1 

0.0045 


0.0062 

U.UUDl 

0.0061 


0.894 

U.oDo 

0.864 


0.0187 

n n9nn 
u.uzuu 

0.0218 


0.0021 
u.uuzu 
0.0031 


0.0045 
n nn/i/i 

U.UU44 

0.0045 


0.654 
u.uyo 
0.762 


c.r. = 50% 






n = 500 








n = 1000 








A 


Time 


dtiqo.it) 


Mean 


SD 


SE 


CP 


Mean 


SD 


SE 


CP 


0.01 


toA 

to.5 

to.e 


0.0264 
0.0269 
0.0279 


0.0266 
0.0268 
0.0265 


0.0062 
0.0064 
0.0066 


0.0042 
0.0041 
0.0043 


0.786 
0.772 
0.754 


0.0266 
0.0270 
0.0275 


0.0043 
0.0046 
0.0051 


0.0033 
0.0033 
0.0034 


0.864 
0.828 
0.792 


Aopi 


toA 

to.5 

to.e 


0.0264 
0.0269 
0.0279 


0.0237 
0.0250 
0.0266 


0.0057 
0.0061 
0.0071 


0.0067 
0.0064 
0.0064 


0.950 
0.918 
0.892 


0.0243 
0.0253 
0.0267 


0.0040 
0.0045 
0.0049 


0.0047 
0.0046 
0.0047 


0.960 
0.924 
0.920 


0.20 


tOA 
to.5 

to.e 


0.0264 
0.0269 
0.0279 


0.0192 
0.0209 
0.0225 


0.0037 
0.0047 
0.0056 


0.0073 
0.0071 
0.0072 


0.952 
0.932 
0.908 


0.0190 
0.0203 
0.0221 


0.0025 
0.0031 
0.0037 


0.0053 
0.0052 
0.0055 


0.828 
0.846 
0.864 



15 



Table 2 

The averages (Mean) and the standard deviations (SD) of 500 estimates, the standard errors 
(SE), and the empirical coverage probabilities (CP) 
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Table 3 

The averages (Mean) and the standard deviations (SD) of 500 estimates, the standard errors 
(SE), and the empirical coverage probabilities (CP) 
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Table 4 

The empirical coverage probabilities of 0.95 simultaneous confidence bands 
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(a) Without Prior Therapy (alpha=0.1) 



(b) With Prior Therapy (alpha=0.1) 
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Figure 1: The estimated time-dependent pAUCs (solid curve) and the 0.95 pointwise confi- 
dence intervals (dotted curve) and simultaneous confidence bands (dashed curve). 
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(a) alpha=0.1 




(b) alpha=0.2 
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Figure 2: The estimated curves for the difference of the time-dependent pAUCs between non- 
therapy and therapy patients (solid curve) and the 0.95 pointwise confidence intervals (dotted 
curve) and simultaneous confidence bands (dashed curve). 
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